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1 The Lovasz splitting-off lemma 



Lovasz's splitting-off lemma states the following. 



Theorem 1 Let G = U {s},E) he a graph such that 



V0^C/cy: d{U)>k, 



(1) 



where d{U) denotes the number of edges between U and U . and k > 2. Also, assume that d{s) (the 
degree of the vertex s) is even. Then for every (s, t) G E, there exists (s, u) € E such that the graph 
G' = (V U s,i? \ {(s,t), (s, u)} U {(t, m)}) abo satisfies the condition (1). 

Proof: Let S denote the set of neighbors of s in G (i.e., S = {u € V : {s,u) e E}). Fix a 
t G S. We would like to show that there exists a. u E S such that condition (1) holds for the graph 
G' = {V L\ s,E\ {{s, t), {s, u)} U {{t, u)}). For the sake of contradiction, assume this does not hold. 
This means that for every u G S, there exists a set J7, ^ J7 C. V, such that d{U) < k + 1 and 
u,t E U (See Figure 1). In other words, the collection of all sets U with d{U) < fc + 1 and t G U 
covers S. Let C be a collection of maximal sets U with d{U) < fc + 1 and t G U that covers S. 



For every U gC, we have d{U) < fc + 1 and d{U U {s}) > k (the latter inequality holds because 
d{U U {s}) =d{V\U)>k by (1)). Therefore, 



and so d{s, V \U) + 1 > d{s, U). On the other hand. d{s, V \ U) + d{s, U) is equal to the degree 
of s, which is an even number. Thus, d{s, V \ U) and d{s, U) have the same parity. Therefore, 
d{s, V\U) > d{s, U). In other words, d{s, U) < ^d{s). This, together with the fact that t G U for 




s 



Figure 1: A set U with d{U) < k + 1 and uGlI 



1 > d{U) - d{U U {s}) = d{s, U) - d{s, V\U) 
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Figure 2: Three sets Ui, U2, Us satisfying properties (2) 

every U G C, shows that two of the sets U E C arc not enough to cover S* (i.e., UiUUj ^ S for every 
Ui, Uj G C). Therefore, C must contain at least three sets Ui, U2, U3 such that 

t G J7i n t/2 n ?73 

Ui \ {U2 U [/3) 7^ 
U2 \ (f/i U [/a) ^ 

U3 \ {Ut U U2) ^ 0. (2) 

See Figure 2. We now use the following inequality which is a consequence of the three-way 
submodularity of the function d. 

d{Ux) + d{U2) + d{Us) > d(c/i n t/2 n t/3) 

+ d([/l\(C/2UC/3)) 

+ d{U2 \ {Ui U C/3)) 

+ d{U3\{UiUU2)). (3) 

It is straightforward to check all cases for an edge e and show that in each case, e is counted at 
least as many times on the left-hand side as it is counted on the right-hand side. This proves the 
above inequality. In fact, there is at least one edge st that is counted three times on the left-hand 
side, but only once on the right-hand side. Therefore, we can strengthen inequality (3) by adding 
a +2 to its right-hand side. Since every term on the left-hand side of (3) is at most A; -|- 1 (by the 
definition of U's) and every term on the right-hand side is at least k (by assumption (1) on the 
graph G and properties (2)), the above inequality implies: 

3A; + 3>4fc + 2=>fc<l. 
This gives us a contradiction since K was assumed to be at least 2. □ 

2 Submodular function minimization 

In the rest of this lecture, we sketch an algorithm for submodular function minimization. This is 
from Chapter 45 of Lex Schrijver's book. 
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Figure 3: The extended polymatroid EPf 



Problem Statement. Given an oracle for a function f -.2° ^ Z, find a set J7 C 5 that minimizes 
/([/) over aU subsets of S. We assume, without loss of generality, that /(0) = 0, otherwise we can 
minimize the function f{U) — /(0) instead of /. 

This problem has many applications. As an example, consider the matroid intersection problem 
that we discussed in previous lectures. We showed that the convex hull of the of the intersection of 
two matroids is the set of all vectors x such that for every U C S, x{U) < ri{U) and x{U) < r2{U), 
where ri and r2 are rank functions of the matroids. Therefore, we can optimize over the intersection 
of two matroids by solving a linear program with the above constraints. It is not obvious how to solve 
this linear program, since it is of exponential size. However, we can get polynomial-time separation 
by minimizing ri(U) — x(U) over all U C S, and chcc;king if the minimum is non-negative (for 
i = 1,2). This can be done using an algorithm that solves the submodular function minimization, 
since ri — a; is a submodular function. 

Notice that it is not obvious that minimizing a submodular function given by an oracle is possible 
in polynomial time. Clearly, without the assumption of submodularity, it is not possible to find the 
minimum of the function before calling the oracle on all 2" points on which the function is defined. 
However, in the rest of this lecture we will sketch an algorithm due to Lex Schrijver that solves this 
problem for submodular functions in polynomial time. 

We start by defining two polyhedra related to a submodular function /. The first polyhedron is 
called the extended polymatroid associated with /, and is defined as follows: 



Notice that this definition does not require a; > 0. As an example, if = {1, 2} and / is defined 
by /(0) = 0,/({l}) = l,/({2}) = -1,/({1,2}) = 0, then the extended polymatroid EPf is the 
shaded area in Figure 3. 

Prior to the algorithm of Schrijver, there was a polynomial-time (but not strongly polynomial- 
time) algorithm for submodular function minimization based on the ellipsoid algorithm and the 
polyhedron EPf. 

We define the second polyhedron, which is called the base polyhedron, as follows: 



EPf = {xGR^ : 



x{U)<f{U), \fUCS}. 



Bf = {xeR^ : 



xiS) = fiS), x{U)<f{U), V?7C5}. 
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For example, for the function in the previous example, the polyhedron Bf consists of one point 
that is marked by a cross in Figure 3. 

Our goal is the following. 
Goal. Find a set ?7 C 5 and a vector x G Bf such that 



x{v) < Vw e f/ (4) 
x{v) > yv^U (5) 
x{U) = f{U) (6) 

(7) 



Claim 2 // we can find a set U and vector x G Bf satisfying properties (4)-(6), then U is the set 
that minimizes f{U). 

Proof: This is because for every set W C S, 

f{U)=x{U)<x{W)<f{W), 

where the first equality follows from property (6), the second inequality follows from (4) and (5), 
and the third inequality is a consequence of x Cz Bf. □ 

It is not clear how one can prove that a vector x belongs to Bf, since Bf is defined by expo- 
nentially many inequalities. We do this by expressing a; as a convex combination of elements that 

are "obviously" in Bf. Such elements are defined below. In fact, these elements are extreme points 
(and the only extreme points) of Bf, but we do not need this fact in our proof. 

Choose a total order -< on S. For every v G S, we define = {w G S : w ^ v}. The vector 
6^ G is defined by 

b^{v) = f{v^U{v})-f{v^). 
Claim 3 For every total order -< on S, G Bf. 

Proof: By the definition of b^{S) is a telescopic sum that is equal to /(S*) — /(0) = f{S). Now, 
we prove that for every U C S, b'^{U) < f{U). We can prove this by induction on the size of U. If 
\U\ = 0, then the statement is trivial. Otherwise, let v be the maximal element of U (with respect 
to ^). and apply the induction hypothesis on U\{v}. This gives us b^{U\{v}) < f{U\{v}). Since 
V is the maximal element of U, we have U Q v^ U {v}. Therefore, by the submodularity of /, we 
have b'^{v) = f{v^ U {v}) — f{v^) < f{U) — f{U\{v}). By adding this inequality with the previous 
inequality we obtain b^ {U) = b^ {U \ {v}) + b-^ {v) < f{U). □ 

We will find a vector x satisfying properties (4) (6) and express the vector x as a convex com- 
bination of b'^ 's, thereby showing that x G Bf. We do this by starting from an arbitrary x that can 
be written as a convex combination of 6^ 's, and modify x (along with the expression that gives x as 
a convex combination of 6^'s) until there exists a U such that (a;, U) satisfies the desired properties. 

Suppose we have a vector x that can be written as x = X^iLi ^ib'^'-, where A; > for all i. Since 
Bf C = K" and all points in Bf must satisfy an equality, the dimension of Bf is at most n — 1, 
and hence x can be expressed as a convex combination of n extreme points. So, we assume k <n. 
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For every -< and every v € S, b^{v^) — f{v^). We call the set a prefix (also known as a lower 
ideal) of -<. Therefore, if ?7 C 5 is a prefix of -<i for every 1 < i < k, then x{U) = \b^*iU) = 
^*Lj^ Ai/(f/) = f{U). Thus, if wc can find a set U that is a prefix of -<i for every 1 < i < k 
and satisfies x{v) < for v £ U and x{v) > for v ^ U, then we are done. This motivates the 
following definition: Let D = {S, A) be a directed graph on the set of vertices S with the arc set 
A = {{u, v) : u V for some 1 <i <k}. By this definition, a set f is a prefix of every -<i if and 
only if (5*"(C/) = in D. 

Now, let V = {v : x{v) > 0} and M = {v : x{v) < 0}. We consider two cases: 

Ccise 1. D has no directed path from V to Af. In this case, let U be the set of vertices v such that 
there is a path from v to some vertex of A/". Therefore, U contains J\f but nothing from V, and 
is a prefix for every -<j. Therefore, (x, U) satisfy the properties (4) (6), and wc are done. 

Case 2. There is a directed path from V to Af. In this case, we change either x or the way x is 
expressed as a convex combination of 's. Pick s and t on the path from V to Af such that 
t G Af, s ^ Af, and there is an arc from s to t in D (details of the selection rule is omitted). 
We would like to change x or its representation to kill the path from V to Af. We can do this 
either by removing t from Af (i.e., increasing Xt), or by removing the arc {s,t) from D. The 
arc (s,i) is present because s ~<i t fov some i. We focus on one such i, and will try to get s 
closer to t in 



Figure 4: Path from V to Af and the vertices s and t 

Let X* denote the unit vector along along the coordinate t (and similarly for x"). We show 
that for some S > 0, x + i5(x* — x^) € Bf, and moreover, we can write x + S{x* — x^) as a 
convex combination in which s is closer to t. More precisely, we use the following lemma as a 
subroutine. 

Lemma 4 Given -<, s, and t, express the vector 6^ + 5(x* — x'') Jot some 5 > Q as a convex 
combination of 6^= " for u G {s,t]^ = {u : s < u < t}, where -<s,u is the total order that is 
obtained from -< by moving u before s. 

Proof: We assume that 6^ = 0. This assumption is without loss of generality, because we 
can replace f{U) by f{U) — b^{U) and apply the argument on this new function. By the 
submodularity of /, we have 

\i V ^ s ot: V )^ u 
\i s <v <u 
\iv = u. 

The following table shows the pattern of non-negative and non-positive entries in 6^*'"'s, for 
every u e {s,u]^. In this table, — denotes a non-positive entry, -|- denotes a non-negative 



b-^^'^iv) 
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entry, and denotes an entry that is zero. 



u 



s 



t 



s 



- + 
- - + 
- - - + 







t 



- - - 



- + 



If the non- negative entry (+) on one row is zero (i.e., = for some u), then all other 

entries of that row must also be zero, since the sum of the entries in each row must be the same 
as the sum of the entries in 6^ (since they are both equal to f{S)), which is zero. Therefore, 
in this case, we can take S = and use as the desired convex combination. The other 

case is when > for every u G {s,t]^. In this case, we can start from the last row of 

the table (i.e., the vector 6^= *), and for every row of the table, from the row before t to the 
row after s, add a multiple of the row to the current vector so that the + entry cancels out 
the corresponding entry in the current vector. At the end, we will obtain a vector that has 
only one negative entry at position s and one positive entry at position t. Furthermore, since 
the sum of all entries should be zero, the absolute value of these two entries are equal. This 
means that for some 6 > 0, we can write (5(x* — x^) as a convex combination of 6^= ''s. □ 

We iterate the procedure in the above lemma. Intuitively, every time we get s closer to t in at 
least one of ^i's, without changing other ^j's. We might also increase xt- Therefore, after a 
finite number of iterations, we will either remove the arc (s, t) from D, or will remove t from 
J\f. For how s and t are chosen and the analysis of the running time of this procedure, see Lex 
Schrijver's book. 
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